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ABSTRACT 

This  thesis  develops  equations,  software  and  applications  for  logistic  regression 
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planning  models,  using  length  of  service  and  grade  as  carrier  variables.  It  is  seen  that 
the  length  of  service  scale  must  be  partitioned  into  segments  so  that  linear 
approximations  to  the  rate  process  are  tenable.  This  done,  the  direction  and  amount  of 
attrition  rate  change  can  be  approximated  and  interpretations  can  be  made  for  the 
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I.  INTRODUCTION 

A.  PURPOSE  AND  SUMMARY  OF  RESULTS 

The  purpose  of  this  study  is  to  develop  the  logistic  regression  alternative  for 
estimating  attrition  rates  using  length  of  service  and  grade  as  carrier  variables.  It  would 
be  most  useful  if  the  regression  coefficients  showed  temporal  stability  and  were  not 
highly  dependent  upon  the  occupational  specialty.  It  is  hoped  that  this  development 
can  enhance  previously  developed  understanding  of  the  attrition  process  as  it  affects 
the  United  States  Marine  Corps  officer  manpower  data. 

Unfortunately  the  logistic  regression  approach  to  this  problem  does  not  improve 
upon  estimators  developed  by  earlier  workers.  See  Table  8  on  page  30.  It  does, 
however,  contribute  to  the  understanding  of  the  attrition  process  as  it  relates  to  length 
of  service  and  grade.  The  partial  regression  coefficients  can  serve  in  ad  hoc  calculations 
to  indicate  the  direction  of  change  and  to  make  rough  estimates  of  the  amount  of 
change.  These  coefficients  do,  however,  change  in  more  than  small  ways  as  one  cha 
changes  the  military  occupational  specialty.  See  Table  7  on  page  24.  The  aviation 
community  especially  appears  to  possess  coefficients  quite  different  from  those  of  other 
communities. 

B.  BACKGROUND 

The  first  step  in  any  manpower  planning  should  be  a  good  description  of  the 
system  or  organization.  Such  can  allow  us  to  get  reasonable  forecast  values.  Forecasts 
should  never  be  interpreted  as  what  will  happen  but  as  central  estimates  of  what  could 
happen  if  the  assumed  trends  continue.  They  therefore  provide  a  guide  for  management 
action  required  to  achieve  a  desired  objective.  Also,  good  forecast  values  depend  upon 
finding  efficient  ways  to  estimate  attrition  rates.  In  other  words  the  description  of  the 
the  system,  attrition  rates  and  forecasting  are  each  dependent  on  one  another. 

The  forecasts  made  by  manpower  planning  models  are  affected  by  three  general 
factors;  existing  inventory,  projected  losses  and  projected  gains.  In  order  to  project  the 
inventory  into  various  future  time  periods  it  is  necessary  to  forecast  the  future  values 
using  a  realistic  system  of  flow  rates. 

Estimation  techniques  for  the  USMC  officer  attrition  rates  have  been  developed 
by  Major  D.D.Tucker  in  a  thesis  [Ref  1]  submitted  at  the  Naval  Postgraduate  School 


in  September  1985,  and  further  by  Major  John  R.  Robinson  in  a  thesis  [Ref.  2] 
submitted  at  the  Naval  Postgraduate  School  in  March  1986.  They  used  James-Stein 
and  other  shrinkage  type  parameter  estimator  schemes  for  the  purpose  of  generating 
stable  manpower  loss  rates.  The  reader  is  referred  to  Tucker  [Ref.  1]  and  Robinson 
[Ref.  2]  for  most  of  the  background  information  and  the  data  structure  used.  By 
necessity,  some  of  that  information  will  be  repeated  in  this  paper. 

The  United  States  Marine  Corps  has  about  20,000  officers.  These  can  be  cross 
classified  into  40  military  occupational  specialties  (MOS),  31  length  of  service  (LOS) 
cells  and  10  grades;  hence  12400  categories  for  manpower  planning  purposes.  Also 
about  half  of  these  categories  are  unoccupied  for  structural  reasons.  These  structural 
zero  categories  will  be  described  in  chapter  III.  The  officer  attrition  and  promotion 
structure  was  described  by  Tucker  [Ref.  1]. 

One  goal  of  this  paper  is  to  examine  whether  the  logistic  regression  model  is  an 
efficient  way  to  estimate  the  attrition  rates  (i.e.  the  rate  of  leaving  the  service,  not  of 
changes  in  MOS,  LOS  or  Grade)  for  the  officer  MOS  LOS,  Grade  categories.  This 
problem  is  difficult  because  of  the  large  number  of  cells  with  the  low  inventor}'.  Tucker 
[Ref.  1]  and  Robinson  [Ref.  2]  collected  the  cells  into  major  groups  or  aggregates  to 
treat  this  small  cell  problem;  attempts  were  made  to  aggregate  cells  that  were  believed 
to  have  common  statistical  behavior.  In  the  present  work  we  will  not  collect  the  cells 
into  major  groups.  Every  VI OS  will  be  taken  individually.  The  structural  zero  cells  will 
be  dropped  before  applying  the  fitting  procedure.  Namely,  structural  zero  cells  will  not 
be  included  in   the  regression  equations. 

There  are  seven  years  data  available  for  the  present  study.  The  first  four  years 
(from  1977  to  19S0)  will  be  used  for  model  development  and  logistic  regression  fitting; 
the  last  three  years  (from  1981  to  1983)  for  validation. 

C.       ORGANIZATION 

Chapter  II  contains  the  details  of  the  methodology  and  notation  used  in  the 
present  work.  A  brief  summary  of  the  generalized  linear  regression  model  is  presented 
in  this  chapter. 

Chapter  III  explains  the  logistic  regression  model  structure  for  the  Marine  Corps 
data  and  the  validation  procedure.  A  numerical  example  will  be  given  to  illustrate  the 
fitting  and  validation  procedures.  Also,  in  this  chapter  we  will  compare  Figures  of  merit 
with  Robinson's  [Ref.  2]  results. 

Chapter  IV  thoroughly  discusses  the  results  and  recommendations. 


Appendix  A  includes  the  APL  functions  for  the  data  manipulation,  the  logistic 
regression  and  the  validation  of  the  model. 

Appendix  B  illustrates  the  logistic  probability  plots  of  residuals  and  the  plots  of 
the  residuals  vs.  fitted  values  for  selected  cases. 
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II.  METHOD  OF  ESTIMATION 

A.  INTRODUCTION 

A  major  use  of  regression  models  is  prediction.  Thus,  given  data  on  a  response 
variable  y  and  associated  predictor  variables  x.  (i  =  1  to  p),  the  aim  of  the  regression  is 
to  find  a  function  of  the  x.'s  which  is,  in  some  sense  a  eood  predicator  of  v.  It  is 
assumed  throughout  that  the  x.'s  at  which  future  predictions  are  required  are  not 
specified  in  advance  but  will  occur  randomly  over  some  population  of  values  and  that 
the  success  of  prediction  can  be  judged  by  its  performance  over  such  a  population. 

Logistic  regression  is  a  member  of  the  class  of  generalized  linear  models.  An 
overview  of  the  linear  model  is  briefly  discussed  in  the  following  section.  All  of  the 
approach  and  background  for  the  logistic  regression  model  was  taken  from  Pregibon's 
[Ref.  3]  paper. 

B.  AN  OVERVIEW  OF  THE  LINEAR  REGRESSION  MODEL 

Linear  regression  is  used  to  relate  a  response  variable  y.  to  one  or  several 
explanatory  or  descriptive  variables  x..  through  a  set  of  linear  equations  of  the  form 


>'i  "  Po  +  J,Pjxij  +  ei  [  =   1'-'n 


1=1 

The  y.  (for  i  =  1  to  n)  are  the  n  observed  values  of  the  response  variable,  the  x..  (for  i 
=  1  to  n)  are  the  n  values  of  the  j  th  explanatory  variable  (for  j  =  1  to  p),  and  the 
parameters  p.  are  the  unknown  regression  coefficients.  The  e.  are  the  random  "errors" 
or  fluctuations.  The  variables  X:.  and  y.  are  sometimes  called  "independent"  and 
"dependent"  variables. 

The  linear  equation  above  can  be  simplified  by  defining  an  extra  variable  x-Q 
whose  value  is  always  1  (xJ0  =   1),  so  the  model  with  constant  term  can  be  written  as, 


,  =  J?  (3.x.. 

1    r=oHJ " 


+  8.  i  =   1 n 


Usually  the  e.  are  assumed  to  be  statistically  independent  of  each  other  with  zero 
means  and  with  a  constant  variance  that  does  not  depend  on  i  or  x... 

In  regression  we  usually  want  to  estimate  the  regression  coefficients  from  the 
data,  either  because  we  want  to  know  and  interpret  the  coefficients  themselves,  or 
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because  we  will  use  them  to  predict  future  values  of  y..  Upon  replacing  p.  by  their 
estimated  values  (3.,  we  obtain  the  fitted  (or  "predicted")  values  y., 


9.  =  YPp.x..  1=   l,...,n 


j  =  0 

The  residuals  8.  are  defined  as  the  differences  between  the  observed  and  the  fitted 
1 


values. 


£.  =  v.  -  v.  i  =   l,...,n 

1       - 1     *  1  '    ' 

The  residual  are  used  in  many  diagnostic  displays  because  they  contain  most  of 
the  information  regarding  lack  of  fit  of  the  model  to  the  data.  In  terms  of  fitted  and 
residuals,  we  have 

data  =  fit  +  residual 

which  in  mathematical  notation  is  expressed  as 

y.  =  ;     8.x..  +  e.  i  =  1 n 

In  matrix  notation  the  least-squares  estimate  P  can  be  found  as  follows, 

<p  =  z2  =  ||y-Xp||2  =  (y  -  XfaJ(  y  -  Xp) 

where  z  is  the  vector  of  residuals  ,  z    is  the  square  length  of  residuals  and  y  =  Xp  is 
the  vector  of  fitted  values.  When  we  do  some  algebra,  the  equation  becomes 

<p  =  yTy-  2yTXp  +  pTXTXp 

A  A. 

If  we  take  the  derivative  of  <p,  subject  to  p  and  set  the  d(p/<3p  equal  to  0,  then  the  least- 
squares  estimate  P  is  obtained  by  solving  this  normal  equation 

XTy  -  XTXP  =  0 

The  solution  of  the  linear  system  is 

V  =  (XJX)-lXTy 

which  is  sensitive  to  poorly  fit  observations  and  extreme  design  points. 

Presently,  there  is  a  fairly  large  battery  of  diagnotics  available  for  detecting  which 
observations  exert  undue  influence  on  p.  The  two  basic  quantities  that  are  most  useful 
for  this  purpose  are  the  residuals,  £■  =  y-  -  XjP,  and  the  projection  matrix 
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M  =  I  -  H  =  I-  X(XTX)_1XT 

where  H  is  called  hat  matrix.    Essentially,  the  vector  c  describes  the  deviation  of  the 
observed  data  from  the  fit.  and  M  the  subspace  in  which  £  lies 

As  a  bottom  line,  the  residual  vector  e  is  important  for  the  detection  of  ill-fitting 
points,  but  will  not  adequately  point  to  observations  which  unduly  influence  the  fit.  In 
particular,  large  residuals  are  seldom  associated  with  high-leverage  points,  whereas 
small  residuals  (which  usually  pass  our  inspection  unnoticed)  are  typically  of  the 
opposite  character. 

C.       BACKGROUND  AND  NOTATION  FOR  THE  LOGISTIC  REGRESSION 

1.  General 

A  maximum  likelihood  fit  of  a  regression  model  is  extremely  sensitive  to 
outlying  responses  and  extreme  points  in  the  design  space. 

Classically,  logistic  regression  models  were  fitted  to  data  obtained  under 
experimental  conditions,  for  example,  bioassay  and  related  dose-response  applications. 
The  current  use  of  logistic  regression  methods  includes  the  analysis  of  data  obtained  in 
observational  studies.  In  contrast  to  controlled  experimentation,  data  from  such 
studies  can  be  notoriously  "bad"  both  from  the  point  of  view  of  outlying  responses  (y), 
and  from  the  point  of  view  of  extreme  points  in  the  design  space  (X).  The  usual 
method  of  fitting  logistic  regression  models,  maximum  likelihood,  has  good  optimality 
properties  in  ideal  settings,  but  is  extremely  sensitive  to  "bad"  data  of  the  above  types. 

In  particular,  good  data  analysis  for  the  logistic  regression  models  need  not  be 
expensive  or  time  consuming. 

2.  Unstructured  case 

Consider  a  single  binomial  response  y  ~  B(n,p).  If  we  let  0  =  logit(p)  = 
log(p  (1  -  p)},  the  probability  function  of  y  can  be  written  as 

Ky;  9)  =  exp{y0  -  a(0)  +  b(y)}  y  =  0,1, ..,n 

with  a(9)  =  n  log(l  +  e  ),  b(y)  =  log  (C)  and  where  throughout  this  paper  log(.)  = 
loge(.).  Up  to  an  arbitrary  constant,  the  logarithm  of  f(y;  0), 

1(0;  y)  =  y0  -  a(0)  +  b(y) 

is  the  loglikelihood  function  of  0.  The  score  and  information  functions  are  given  by, 
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£1(9;  v) 
y    =  "TO =  y  -  a  (9)  =  y  -  np 

-<3s(0;  y) 
v(0;y)  -       q  a(0)  =  np(l-p) 

where  "a"  with  k  dots  above  it  denotes  (dk  /  50  )a{0).  Standard  results  yield  E(s(0;  y)} 
=  np  =  a(0)  and  Var(y)  =  np(l  -  p)  =  a(0).  Also,  since  s(0;  y)  =  0  at  the  maximum 
likelihood  estimate  (m.l.e)  of  0,  we  have  0  =  a  (y)  =  logit(y/n)  as  the  m.l.e.  of  0 
based  on  a  single  binomial  observation  y. 

Given  a  sample  of  N  independent  binomial  responses  y.  ~  B(n..p.).  The 
loglikelihood  function  for  the  sample  is  the  sum  of  individual  loglikelihood 
contributions: 

1(0;  y)  =  VNl(0i;  y.)  =  £  (y  0  -  a(0j)  +  b(y.)) 

i=l     *     l        i  =  l     l  1  l  i 

3.  The  logistic  regression  model 

The  likelihood  function  1(0;  y)  is  over-specified.  There  are  as  many  parameters 

as  observations.  Given  a  set  of  m  explanatory  variables  (XpX2, X   ),  the  logistic 

regression  model  utilizies  the  relationship 

0  =  logit(p)  =  XP 

as  the  description  of  the  systematic  component  of  the  response  y.  In  terms  of  the  m 
dimensional  paramater  P,  we  have  the  loglikelihood  function, 

1(X;  p)  =  fViP;  y,)  =  fj.  xfi  -  a(XjP)  +  b(yj) 

The  m.l.e.  maximizes  the  above  equation  and  is  a  solution  (assumed  unique)  to 
(<3,<3P)  l(Xp;  y)  =  0.    In  particular,  P  satisfies  the  system  of  equations: 

£Nxii(yra(xiP))=  0  j  =  l,..,m 

Writing  s  =  y  -  a(XP)  =  y  -  np,  the  formulation  of  the  likelihood  equations  is 
XTs  =  XT(y  -  f)  =  0 
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where  y  =  np  and  T  denotes  the  transpose.  These  equations,  although  very  similar  to 
their  normal  theory  counterparts,  are  nonlinear  in  P  and  iterative  methods  are  required 
to  solve  them.  Typically,  when  second  derivatives  are  easy  to  compute  (in  the 
-(d  c?(3)XTs  =  XTV  X  with  V  =  diagonal(a(  x-fj)}),  the  Newton-Raphson  method  is 
employed  .  This  leads  to  the  iterative  scheme 

Pt+1  =  Pr  +  (XTVX)_1XTs 

where  both  V  and  s  are  evaluated  at  p\  At  convergence  (t  =  u).  we  take  p  =  pu  ,  and 
denote  the  fitted  values  n.  p.  by  v..  The  estimated  variance  of  v.  is  v..  =  nn(l  -  p.). 

A  most  useful  way  to  view  the  iterative  process  outlined  above  is  by  the 
method  of  iteratively  reweighted  least-squares  (IRLS).  This  is  obtained  by  employing 
pseudo  observation  vector  z    =  Xp    +  V     ]s,  for  which  the  above  equation  becomes 

pt+l  =  (XTVX)~1XTVzt 

At  convergence,  we  have  z  =  Xp  +  V  ls.  Thus  we  may  write  the  maximum 
likelihood  estimator  of  P  as 

P  =  (XTVX)_1XTVz 

4.  Output  from  a  maximum  likelihood  fit 

Once  the  model  has  been  fitted  (that  is,  we  have  the  m.l.e.  P),  various 
quantities  from  the  fitting  process  are  available  for  the  data  analysis.  Typically,  these 
quantities  consist  of  subsets  of  the  following: 

1.  the  estimated  parameter  vector,  P  ; 

2.  the  individual  coefficient  standard  errors,  s.e.(Pj);; 

3.  the  estimated  covariance  matrix  of  p,  var(p)  =  (X    VX)        ; 

4.  the  chi-squared  goodness  of  fit  statistic  X         2-  si  /vii  ; 

5.  the  individual  components  of  x     namely  Xj  =  S["JW\\  =  (}'[  -  ^PiVV^PjO  "  Pj); 

^  *  /\ 

6.  the  deviance  D  =  -2{1(XP  ;  y)  -  1(0  ;  y)},  where  1(0  ;  y)  refers  to  the  maximum 
of  the  loglikelihood  function  based  on  fitting  each  point  exactlv,  i.e.,  9.    = 
logitty/rij). 

Asymptotic  arguments  suggest  that  the  deviance  and  chi-squared  statistics 

have  the  same  limiting  null  X  (N  -  m)  distribution,  and  hence  some  measure  of  the 

appropriateness  of  the  fitted  model. 
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D.       THE  BASIC  BUILDING  BLOCKS  OF  REGRESSION  DIAGNOSTICS 

1.  Preliminaries 

After  fitting  a  logistic  regression  model,  and  prior  to  drawing  inferences  from 
it,  the  natural  succeeding  step  is  that  of  critically  assessing  the  fit.  In  practice  however, 
this  assessment  is  rarely  considered  and  seldom  carried  out.  The  basic  reasons  are 

1.  the  lack  of  routine  methods  for  performing  such  an  analysis,  and 

2.  the  presumably  high  cost  of  doing  so. 

The  role  of  a  regression  diagnostician  is  to  provide  routine  methods  of  model 
sensitivity  analysis  which  are  both  intuitively  appealing  and  inexpensive.  Clearly  this 
requires  a  thorough  understanding  of  the  model  and  the  nature  of  the  fitting  process. 

2.  The  basic  building  blocks 

For  the  logistic  regression  model,  the  basic  building  blocks  for  the 
identification  of  outlying  influential  points  will  again  be  the  residual  vector  and  a 
projection  matrix.  For  the  linear  model,  residuals  are  rather  uniquely  defined  (apart 
from  standardization),  whereas  for  the  logistic  regression  model,  residuals  can  be 
defined  on  several  (at  least  three)  scales.  The  two  most  useful  are  the  components  of 
chi-square,  given  above  in  (e),  and  the  components  of  deviance,  D  =  Y  d- 

d1=   ±V2(l(ei;yi)-  1(  xxp  ;yi)}1/2, 

where  the  plus  or  minus  is  used  according  as  0-  >  xfi  or  9-  <  Xjp.  Note  that  dj  is 
defined  for  all  values  of  y-  even  though  0j  may  not  be.  In  particular,  y  =  0,  d2  =  -2n 
log(l-p)  and  at  y  =  n,  d2  =  -2n  log(p).  Both  X  and  D  are  the  measures  of  the 
goodness-of-fit  of  the  model. 

The  analog  of  the  projection  matrix  for  the  logistic  model  will  also  be  denoted 
by  M,  which  in  its  general  form  is  given  as 

M  =  I  -  H  =  I  -  V1/2X(XTVX)"1XTV1/2 

The  usefulness  of  M  arises  as  a  consequence  of  the  IRLS  formulation  described  earlier. 
In  particular,  as  P  =  (XTVX)-  ^"^Vz,   the  vector  of  pseudo-residuals  is  given  by 

z  -  XP  =  {I  -  X(XTVX)  "  lXJ\}z  =  V  "  i;2MVl/2z 

using  the  fact  that  z  =  Xp  +  V~  Js,  this  can  be  written  as  V_1s  =V_1  2  MV_1  2s 
Premultiplication  by  the  diagonal  matrix  V1  2  yields  X  =  MX)  where  x  =  V  1,2s 
Thus,  as  in  the  linear  model  case,  M  is  symmetric,  idempotent  and  spans  the  residual 
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(X)   space.     This   suggests   that    small   m..   which   are   the   diagonal   elements    of  the 
projection  matrix  M  should  he  useful  in  detecting  extreme  points  in  the  design  space. 

In  most  cases,  the  examination  of/.,  d.  and  m.  will  call  attention  to  outlying 
and  influential  points.  In  some  cases,  combinations  of  these  (for  example,  studcntized 
residuals)  will  also  be  useful.  For  displaying  these  quantities,  index  plots  are  generally 
(and,  if  the  order  of  the  observations  is  important  strongly)  suggested:  that  is,  plots  of 
y.  vs  i,  d.  vs  i  and  m..  vs  i.  In  particular  cases,  plots  of  these  building  blocks  against  the 
fitted  values  could  prove  useful. 
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III.  MODEL  BUILDING  WITH  USMC  MANPOWER  DATA 

A.       GENERAL 

Robinson  [Ref.  2]  explains  the  conversion  of  the  raw  data  to  an  APL  workspace.  A 
brief  explanation  about  the  conversion  is  given  in  Appendix  A.  The  summary  data  file 
classifies  the  Marine  Corps  officer  inventory  into  40  military  occupational  specialties, 
10  grade  levels,  31  length  of  service  and  8  loss  categories.  In  the  present  study  we  are 
not  dealing  with  the  type  of  loss.  These  were  described  by  Tucker  [Ref.  1]  For  use  in 
our  model  we  need  to  define  grades  and  military  occupational  specialties  (MOS)  by 
Table  1  and  2.  When  reference  is  made  to  a  particular  grade  or  group  of  grades  the 
code  number  from  Table  1  used  instead  of  the  name  of  the  grade.  For  example  this 
project  will  refer  to  the  grades  first  lieutenant,  captain  and  major  as  numbers  5,  6  and  7 
respectively.  Tucker  and  Robinson  used  data  code  numbers  for  the  MOS  instead  of 
the  actual  MOS.  For  example,  this  project  will  refer  to  the  Air  traffic  control  MOS  as 
number  37  not  73.  It  should  also  be  understood  that  the  two  digit  MOS  identifier  listed 
in  Table  2  is  strictly  the  military  occupational  specialty  identifier  in  the  US  VIC  MOS 
manual.  We  will  also  use  the  code  number  from  Table  2  for  the  MOS.  The  column 
containing  the  letters  A  through  E,  refer  to  the  structural  zero  categories. 


TABLE  1 

GRADES 

CODE 

GRADE 

0 

WARRANT  OFFICER  (W-l) 
CHIEF  WARRANT  OFFICER  1 

1 

[CWO-2) 

2 

CHIEF  WARRANT  OFFICER  1 

CWO-3 

3 

CHIEF  WARRANT  OFFICER  1 

CWO-4) 

4 

SECOND  LIEUTENANT 

5 

FIRST  LIEUTENANT 

6 

CAPTAIN 

7 

MAJOR 

8 

LIEUTENANT  COLONEL 

9 

COLONEL 

TABLE  2 

MILITARY 

OCCUPATIONAL  SPECIALTIES  (MOS) 

DATA 

CODE 

MOS 

CAT 

MOS  TITLE 

00 

UN 

A 

UNKNOWN 

01 

01 

A 

PERSONNEL  AND  ADMINISTRATION 

02 

02 

A 

INTELLIGENCE 

03 

03 

C 

INFANTARY 

04 

04 

A 

LOGISTICS 

05 

08 

A 

FIELD  ARTILLERY 

06 

11 

D 

UTILITIES 

07 

13 

A 

ENGINEER,  CONSTRUCTION  AND  EQUIPMENT 

08 

14 

D 

DRAFTING,  SURVEYING  AND  MAPING 

09 

15 

D 

PRINTING  AND  REPRODUCTION 

10 

18 

C 

TANK  AND  AMPHIBIAN  TRACTOR 

11 

21 

A 

ORDNANCE 

12 

23 

B 

AMMUNITION  AND  EXPLOSIVE  ORDNANCE 
DISPOSAL 

13 

25 

A 

OPERATIONAL  COMMUNICATIONS 

14 

26 

A 

SIGNALS  INTELLIGENCE/GROUND  ELECTRONIC 
WARFARE 

15 

28 

B 

DATA/COMMUNICATIONS  MAINTENANCE 
SUPPLY  ADMINISTRATION  AND  OPERATIONS 

16 

30 

A 

17 

31 

A 

TRANSPORTATION 

18 

33 

A 

FOOD  SERVICE 

19 

34 

A 

AUDITING, FINANCE  AND  ACCOUNTING 

20 

35 

A 

MOTOR  TRANSPORT 

21 

40 

A 

DATA  SYSTEMS 

22 

41 

B 

MARINE  CORPS  EXCHANGE 

23 

43 

A 

PUBLIC  AFFAIRS 

24 

44 

A 

LEGAL  SERVICES 

25 

46 

A 

TRAINING  AND  AUDIOVISUAL  SUPPORT 

26 

55 

B 

BAND 

27 

57 

D 

NUCLEAR, BIOLOGICAL  AND  CHEMICAL 

28 

58 

A 

MILITARY  POLICE  AND  CORRECTIONS 

29 

59 

B 

ELECTRONICS  MAINTENANCE 

30 

60 

A 

60  XX 

31 

61 

A 

AIRCRAFT  MAINTENANCE 

32 

63 

B 

AVIONICS 

33 

65 

B 

AVIATION  ORDNANCE 

34 

68 

B 

WEATHER  SERVICE 

35 

70 

D 

AIRFIELD  SERVICES 

36 

72 

A 

AIR  CONTROL,  AIR  SUPPORT  AND  ANTI-AIR 
WARFARE 

37 

73 

A 

AIR  TRAFFIC  CONTROL 

38 

75 

C 

PILOTS  AND  NAVAL  FLIGHT  OFFICERS 

39 

99 

E 

IDENTIFYING  MOS  AND  REPORTING  MOS 

A  structural  zero  is  a  cell  whose  inventory  is  always  zero  because  certain  grades 
and  length  of  service  combinations  should  never  appear  in  that  military  occupational 
specialty  (MOS).  For  example  a  Colonel  with  5  years  of  service  in  any  MOS  or  an 
inventory  warrant  officer  in  MOS  03  does  not  exist.  The  effect  of  these  structural  zero 
categories  is  summarized  in  Table  3. 
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TABLE  3 

STRUCTURAL  ZEROES  CATEGORIES 

Category 

Grades 

within  MOS 

Number 
of  MOS 

Stru. 
Zeroes 
per  MOS 

Totol 
Zeroes 
per  Cat. 

A 
B 
C 
D 

E 

WOl. . . LTCOL       23 
WOl. . . CW04,LDO     8 
2LT. . . LTCOL        3 
WOl. . . CW04         5 
WOl. . . COL          1 

129 
159 
202 
237 
119 

2967 
1272 

606 
1185 

119 

TOTAL 

40 

6149 

B.       HOW  TO  BUILD  THE  LOGISTIC   REGRESSION  MODEL  WITH  USMC 
DATA 

1.  Introduction 

The  purpose  of  this  study  is  to  develop  the  logistic  regression  model  for 
estimating  USMC  officer  attrition  rates  using  length  of  service  (LOS)  and  grade  (GR) 
as  carrier  variables.  The  logistic  regression  model  for  the  estimation  of  USMC  officer 
attrition  rates  can  be  formulated 

0  =  logit(p)  =  Pj  +  P2(LOS)  +  P3(GR) 

In  matrix  notation,  this  can  be  written  as 

0  =  xp 

where  X  is  Nxm  matrix,  also  called  the  design  space  and  P  is  the  mxl  matrix,  also 
called  the  coefficients  of  the  regression.  Then,  it  can  be  said  that  0  =  logit(p)  is  a  Nxl 
matrix. 

2.  How  to  create  the  design  space 

Each  MOS  is  taken  individualy  for  the  estimation  of  officer  attrition  rates. 
Every  MOS  has  dimension  31x10  for  31  LOS's  and  10  grades.  Each  LOS  and  grade 
must  be  broken  into  segments  and  each  segment  is  a  seperate  regression.  As  an 
example,  any  MOS  can  be  broken  into  four  segments  as  in  Table  4.  Each  segment  has 
its  own  X  matrix.  Each  design  space  (X)  has  dimension  Nxm  where  N  stand  for  the 
number  of  independent  binomial  responses  and  m  stand  for  the  number  of  explantory 
variables,  which  is  always  three  in  our  case.  This  X  matrix  can  be  written 
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CM     LOS     GR 


X 


x 


11 


v21 


"12 

"22 


X 


13 


X23 


(Nx3) 

xNl       XN2      XN3 

where  CXT  means  constant  which  is  the  first  column  of  the  X  and  always  one. 


LOS 

18<LOS<30 
8<LOS<19 
4<LOS<10 
0<LOS<  5 


TABLE  4 
SEGMENTS 


GRADE 


8,9    (LTCOL,    COL) 

5,6,7,8    (FIRST    LT,    CAPT,    MAJ, LTCOL) 
4,5,6    (SECOND    LT,     FIRST    LT,CPT) 
1,2,3,4    (CWO-2,CWO-3,CWO-4, SECOND    LT) 


C.       A  NUMERICAL  EXAMPLE  FROM  THE  USMC  DATA 

As  an  illustration  of  the  standard  output  from  a  maximum  likelihood  fit  and  the 
use  of  the  logistic  regression  model,  we  will  use  the  case  where  military  occupational 
specialty  (MOS)  =  20  (motor  transport,  from  Table  2),  length  of  service  (LOS)  = 
from  5  to  19  years  and  grades  =  4,5,6,7  (second  lieutenant,  first  lieutenant,  captain 
and  major,  from  Table  1).  The  data  are  listed  in  Table  5.  They  are  obtained  using  the 
APL  data  manipulation  functions  described  in  detail  in  Appendix  A. 

In  Table  5,  the  structural  zero  inventory  cells  are  dropped  before  applying  the 
fitting  procedure.  The  output  listed  in  Table  6,  is  obtained  using  the  APL  logistic 
regression  functions  in  Appendix  A.  We  get  the  estimated  coefficients  of  regression  as 
follows, 

Pj  =  0.548539 

P2  =  -0.17092 

J33  =  -0.20117 
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TABLE  5 

DATA 

CENTRAL 

X 

LOSS 

INVENTORY 

CNT 

LOS 

GR 

*i 

ni 

1 

5 

4 

0 

4.  5 

1 

5 

5 

6 

33.  5 

1 

5 

6 

1 

3 

1 

6 

4 

0 

2.  5 

1 

6 

5 

5 

19 

1 

6 

6 

0 

9 

1 

7 

4 

0 

1.  5 

1 

7 

5 

1 

5.  5 

1 

7 

6 

2 

13 

1 

8 

5 

3 

3 

1 

8 

6 

1 

14 

1 

9 

4 

0 

1 

1 

9 

5 

3 

4.  5 

1 

9 

6 

2 

12.  5 

1 

10 

4 

0 

1 

1 

10 

5 

0 

3.  5 

1 

10 

6 

0 

12 

1 

10 

7 

0 

0.  5 

1 

11 

4 

0 

0.  5 

1 

11 

5 

1 

7 

1 

11 

6 

0 

5 

1 

11 

7 

0 

3 

1 

12 

5 

0 

7 

1 

12 

6 

1 

5 

1 

12 

7 

0 

4 

1 

13 

5 

0 

10.  5 

1 

13 

6 

0 

4.  5 

1 

13 

7 

0 

3 

1 

14 

5 

0 

10 

1 

14 

6 

1 

7 

1 

14 

7 

0 

4 

The  deviance  for  the  fit,  46.5863  on  28  degrees  of  freedom,  and  the  corresponding 
chi-squared  statistic  is  46.4579.  Both  are  less  than  their  asymptotic  expectation  of  28, 
indicating  no  gross  inadequacies  with  the  model.  In  table  6,  x  is  the  individual 
component  of  x  ,  d.  is  the  component  of  deviance  and  m,.  is  the  diagonal  element  of  of 
projection  matrix  M.  The  examination  of  X->  d.  and  m.  calls  attention  to  outlying  and 
influental  points.  The  individual  components  of/2  and  of  the  deviance  (d.)  are  plotted 
against  the  logistic  probability  plot  in  Figure  3.1.  Evidently,  two  observations,  the  lCr1 
and  13th  are  not  well  fit  by  the  model;  their  x  and  deviance  (residuals)  deviate  from  the 
straight  line  configuration  of  the  others.  Also,  fitted  values  are  plotted  against  the 


22 


TABLE  6 

OUTPUT 

1 

ogit(yi/ni 

) 

e. 

i 

Xi 

d. 
i 

nu- 
ll 

1 

_ 

-1. 

1108 

-1. 

2172 

1. 6005 

0.  8124 

2 

-1. 

5224 

-1. 

3120 

-0. 

4678 

-0. 6487 

0. 5298 

3 

-0. 

6931 

-1. 

5131 

0. 

6884 

1.  2806 

0. 9213 

4 

- 

-1. 

2817 

-0. 

8329 

1. 1066 

0. 9054 

5 

-1. 

0296 

-1. 

4829 

0. 

8775 

0.  9521 

0. 8210 

6 

- 

-1. 

6841 

-1. 

2924 

1.  7506 

0. 8388 

7 

- 

-1. 

4526 

-0. 

5923 

0. 7941 

0. 9459 

8 

-1. 

5040 

-1. 

6538 

0. 

1356 

0. 5473 

0. 9595 

9 

-1. 

7047 

-1. 

8540 

0. 

1957 

0. 5482 

0. 8368 

10 

- 

-1. 

8247 

4. 

3133 

3. 4417 

0. 9784 

11 

-2. 

5649 

-2. 

0259 

-0. 

5256 

-1. 0382 

0. 8666 

12 

- 

-1. 

7945 

-0. 

4076 

0. 5545 

0.  9633 

13 

-0. 

6931 

-1. 

9957 

3. 

5753 

2. 3189 

0. 9625 

14 

-1. 

6582 

-2. 

1969 

0. 

7066 

1. 0379 

0. 8973 

15 

- 

-1. 

9654 

-0. 

3742 

0.  5120 

0.  9620 

16 

- 

-2. 

1666 

-0. 

6332 

0. 8713 

0. 9640 

17 

- 

-2. 

3678 

-1. 

0602 

1.  4660 

0. 9027 

18 

- 

-2. 

5690 

-0. 

1957 

0. 2716 

0. 9891 

19 

- 

-2. 

1364 

-0. 

2429 

0.  3340 

0. 9802 

20 

-1. 

7917 

-2. 

3375 

0. 

5116 

1. 0448 

0.  9114 

21 

- 

-2. 

5387 

-0. 

6283 

0. 8717 

0. 9561 

22 

- 

-2. 

7399 

-0. 

4401 

0. 6127 

0. 9429 

23 

- 

-2. 

5085 

-0. 

7548 

1.  0466 

0. 8942 

24 

-1. 

3862 

-2. 

7096 

1. 

2719 

1.  6268 

0. 9507 

25 

- 

-2. 

9108 

-0. 

4665 

0. 6511 

0. 9309 

26 

- 

-2. 

6794 

-0. 

8487 

1.  1804 

0. 8171 

27 

- 

-2. 

8806 

-0. 

5024 

0. 7008 

0. 9499 

28 

- 

-3. 

0817 

-0. 

3709 

0. 5187 

0. 9515 

29 

- 

-2. 

8503 

-0. 

7604 

1.  0603 

0. 8054 

30 

-1. 

7917 

-3. 

0515 

1. 

2450 

1.  5873 

0. 9133 

31 

-3. 

2527 

-0. 

3932 

0. 5509 

0.  9381 

components  of  the  deviance  and  the  components  of  the  x  'n  Figure  3.2.  For 
displaying  the  combinations  of  y.,  d.  and  m..,  index  plots  (i.e.  Y.  vs  i,  d.  vs  i  and  m-  vs 
i)  are  showed  in  Figure  3.3. 

Also,  we  selected  some  cases  to  examine  whether  the  coefficients  of  regression 
have  temporal  stability  or  not.  The  estimated  coefficients  of  regression  are  listed  by 
Table  7  for  the  selected  cases. 
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TABLE  7 

COEFFICIENTS  OF  REGRESSION  FOR  SOME  CASES 

MOS  = 

3  ( INFANTRY) 

Pi 

h 

P3 

0<LOS<  6 

3<L0S<  9 

9<L0S<  19 

19<LOS<29 

AND  4<GR<6 
AND  4  <  GR  <  6 
AND  5<GR<8 
AND  7<GR<9 

-5. 786 

-2. 029 

4.  714 

-1.  376 

0.  037 

-0. 212 

0.  047 

0.  191 

0.  764 

0.  245 

-1.  389 

-0. 609 

MOS  = 

7  (ENGINEER, 

CONSTRUCTION 

AND  EQUIPMENT) 

h 

P2 

P3 

0<LOS<  6 

3<LOS<  9 

9<LOS<  19 

19  <  LOS  <  29 

AND  4<GR<6 
AND  4  <  GR  <  6 
AND  5<GR<8 
AND  7<GR<9 

-5. 900 

-1.  758 

3.  846 

0.  021 

0.  037 
-0. 129 
-0. 160 

0.  150 

0.  827 

0.  129 

-0. 845 

-0. 639 

MOS  = 

13  (OPERATIONAL  COMMUNICATION) 

Pi 

P2 

h 

O^LOS<  6 

3<LOS<  9 

9  <  LOS  <  19 

19<LOS<29 

AND  4<GR<6 
AND  4<GR<6 
AND  5<GR<8 
AND  7<GR<9 

-5. 995 

-1. 188 

3.  366 

-0. 783 

0.  038 
-0. 186 
-0. 117 

0.  178 

0.  884 
0.  281 
-0. 776 
-0. 614. 

MOS  = 

20  (MOTOR  TRANSPORT) 

Pi 

P2 

p3 

0<LOS<  6 

3<LOS<  9 

9  <  LOS  <  19 

19  <  LOS  <  29 

AND  4<GR<6 
AND  4  <  GR  <  6 
AND  5<GR<8 
AND  7<GR<9 

-7. 406 

-4.  438 

1.  866 

-0. 440 

-0. 089 

-0. 066 

-0.  315 

0.  009 

1.  249 

0.  646 

-0. 135 

-0. 101 

MOS  = 

38  (PILOTS  AND  NAVAL  FLIGHT  OFFICERS) 

Pi 

P2 

P3 

0  <  LOS  <  6 

3<LOS<  9 

9  <  LOS  <  19 

19<LOS<29 

AND  4<GR<6 
AND  4<GR<6 
AND  5<GR<8 
AND  7<GR<9 

-10. 1922 

-10. 8841 

2. 1225 

0.  3936 

-0. 0404 

-0. 1476 

-0. 1663 

0. 2257 

1.  5493 

1.  7560 

-0. 4317 

-0. 8984 
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D.       VALIDATION  OF  MODEL 

A    validation    test    was    conducted    to    evaluate    the    efficiency    of  the    logistic 

regression  model  for  the  estimation  of  the  L'SMC  officer  attrition  rates.  The  test  was 

conducted  as  follows: 

1.     Select   the   LOS's   and   erades  within   a   military   occupational   specialty.   The 
resulting  desired  array  will  be  three  dimensional  (years,  LOS,  grades) 

Let  "i"  stand  for  LOS.  then  i  =  0,...,30 

Let  ")"  stand  for  GR,  then  j  =  0,...,9 

Let  v..  =  number  of  leavers  in  cell  (i.i) 
-ij  J 

Let  n..  =  central  inventory  in  (i.j)  =  max  ((N(t)  +  N(t+  1)),2,  Y(t  )} 

Let  t  =   1 T  where  T  =  number  of  vears  (i.e  from  1977  to  1983)  of  data  used 

to  create  the  estimator 


The  validation  procedure  used  t  =  1,...,4  (i.e.  from  1977  to  1980)  for  the  fitting 
and  t  =  5.6,7  (i.e.  from    1981  to  1983)  for  validation. 

The  following  procedures  were  utilized  to  validate  the  effectiveness  of  the  logistic 
regression  estimation  process.  We  define  an  indicator  variable 

1        p..  =  0  or  1 
D..  =        if 


0        p..  *  0  or  1 


Then 


K  =  V V  Dr  for  all  i  and  j 

where  K  is  the  number  of  nonstructural  zeroes  cells.  Then  validation  test  can  be 
formulated  as  chi-square  goodness  of  statistic  test  as  follows 

(Pij  -  Pij)2 

Chi-square  MOE  =  £Y  D.. n..  for  all  i  and  j 

Where  p..  is  found  from  the  fitting  using  the  estimator  years,  p..  ( =  y/n)  can  be 
obtained  from  the  validation  and  the  central  inventory  which  comes  from  the 
validation  years.  For  our  numerical  example,  (MOS  =  3,  LOS  =  5  through  14  and 
GR  =  4,5,6,7)  we  get  the  following  validation  test  results  for  the  years  1981,  19S2  and 
1983  specifically  MOE;  are  52.6998,  36.4182  and  30.6585  respectively. 
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Figure  3.1     Probability  plots  of  X-  and  d. 
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Figure  3.2     Plots  of  fitted  values  vs  /•  and  d.. 
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E.       COMPARISON  OF  THE  FIGURES  OF  MERIT 

In  this  section,  we  will  compare  the  figures  of  merit  with  Major  Robinson's 
[Ref  2]  results.  As  we  mentioned  before,  he  used  the  limited  translation  shrinkage 
estimation  (LTSE)  for  the  estimation  of  USMC  officer  attrition  rates.  We  have  been 
using  a  different  estimation  method  for  the  same  manpower  data.  Also,  he  used 
procedure  which  we  explained  in  the  above  section  to  validate  the  effectiveness  of  the 
limited  translation  shrinkage  estimaton.  In  order  to  compare  the  figures  of  merit  of 
logistic  regression  and  the  shrinkage  estimation,  we  present  some  results  for  some  cases 
in  Tables  8  and  9. 

If  we  look  at  the  tables  we  can  see  that  shrinkage  estimation  looks  better  than 
logistic  regression  estimation  for  most  of  the  selected  cases.  We  can't  say  that  limited 
translation  shrinkage  estimation  is  much  better  than  logistic  regression.  The  results  are 
very  close  to  each  other  for  some  cases,  even  though,  logistic  regression  is  sometimes 
better  than  shrinkage  estimation  (i.e.  for  case  MOS  =  20,  3  <  LOS  <  9  and 
4<GR<6). 
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TABLE  8 

FIGURES  OF  MERIT 

( 0  <  LOS  <  6 )  AND  ( 4 

<GR 

<6) 

1981 
MOS  =  3  ( INFANTRY) 

1982 

1983 

LTSE                   27.8528 
REGRESSION             59.8577 

42. 4799 
88. 5361 

45. 9140 
86. 6193 

MOS  =  7  (ENGINEER,  CONSTRUCTION 

AND 

EQUIPMENT) 

LTSE                   13.2892 
REGRESSION             35.3195 

18. 8664 
31.  3636 

20. 7735 
27. 6810 

MOS  =  13  (OPERATIONAL  COMMUNICATIONS) 

LTSE                   22.4989 
REGRESSION             41.  7272 

16. 1496 
31.  5084 

13. 5038 
30. 6847 

MOS  =20  (MOTOR  TRANSPORT) 

LTSE                   15.9591 
REGRESSION             24.4329 

34. 4740 
28. 3449 

17. 8570 
22. 5246 

(  3  <  LOS  <  9  )  AND  (  4 

<GR 

<6) 

1981 

MOS  =  3  ( INFANTRY) 

1982 

1983 

LTSE                   19.  1602 
REGRESSION             73.0644 

67. 2562 
89. 0204 

34.  1118 
61. 9981 

MOS  =  7  (ENGINEER,  CONSTRUCTION 

AND 

EQUIPMENT) 

LTSE                   20.5515 
REGRESSION             60.5127 

19. 8988 
40. 1607 

18. 2333 

26. 2687 

MOS  =  13  (OPERATIONAL  COMMUNICATIONS) 

LTSE                   20.3665 
REGRESSION             28.6348 

15. 3913 
25. 9982 

17. 6670 
32. 2280 

MOS  =20  (MOTOR  TRANSPORT) 

LTSE                   22.3545 
REGRESSION             26.  1725 

52. 2840 
31. 6402 

35. 5580 
19. 7830 
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TABLE  8 
FIGURES  OF  MERIT  (CONT'D.) 

(9<LOS<19)    AND    (5<GR<8) 

1981  1982  1983 

MOS    =    3    ( INFANTRY) 

LTSE  84.5388  70.3422  40.2220 

REGRESSION  149.5783  61.7802  41.9882 

MOS    =    7    (ENGINEER,    CONSTRUCTION   AND    EQUIPMENT) 

LTSE  42.4237  22.9296  17.3584 

REGRESSION  84.4140  48.6112  24.7120 

MOS    =    13    (OPERATIONAL   COMMUNICATIONS) 

LTSE  48.3150  25.9520  26.6658 

REGRESSION  108.1312  41.2197  37.5635 

MOS    =20    (MOTOR   TRANSPORT) 

LTSE  20.5629  24.6164  16.2029 

REGRESSION  41.8773  44.0796  33.7604 

(19<LOS<29)  AND  (7<GR<9) 

1981  1982         1983 

MOS  =  3  ( INFANTRY) 

LTSE  30.0620       18.9604      29.1716 

REGRESSION  46.3861       28.9819      32.3470 

MOS  =  7  (ENGINEER,  CONSTRUCTION  AND  EQUIPMENT) 

LTSE  21.8423       25.2194      34.9758 

REGRESSION  28.3865       33.0140      35.8610 

MOS  =  13  (OPERATIONAL  COMMUNICATIONS) 

LTSE  46.9617       20.6439      10.8807 

REGRESSION  77.5956       36.2923      21.5748 

MOS  =20  (MOTOR  TRANSPORT) 

LTSE  12.5150       15.5716      12.9169 

REGRESSION  23.2035       27.9930      31.8230 
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IV.  CONCLUSIONS  AND  RECOMMENDATIONS 

A.  CONCLUSIONS 

Recall  that  the  logistic  function  and  its  inverse  can  be  expressed  as 

9  =  In  [p/(l-p)>    and   p  =  e    /  (1  +  e  ) 

Further,  it  is  useful  to  record  , 

dp.'dG  =  e    /  (1  +  e  )2 

Identifying  p  as  the  attrition  rate,  we  can  use  a  limited  Taylor  approximate  the  change 
in  rates.  Thus, 

Ap  =  p(l  -p)(p2ALOS  4-  p\AGR} 

provides  us  with  a  linear  approximation  to  the  direction  and  amount  of  change. 

Although  the  logistic  regression  approach  does  not  improve  upon  the  attrition 
rate  estimators  developed  by  Tucker  [Ref.  1]  and  Robinson  [Ref.  2]  it  does  point  to  the 
direction  of  change  as  one  varies  LOS  and  GR.  To  this  end,  it  was  necessary  to 
partition  the  30  year  LOS  range  into  segments.  It  is  an  exercise  in  curiosity  to 
speculate  as  to  the  reasons  for  observed  behavior  in  these  segments.  Here  is  our 
offering 

1.  0  <  LOS  <  5;  attrition  rates  are  chaotic  as  young  officers  "test  the  waters". 

2.  3  <  LOS  <  9;  attrition  rates  decline  with  increasing  LOS  as  officers  commit 
themselves  to  lonser  second  and  third  contracts.  One  would  think  that 
advancement  in  grade  would  also  correlate  with  a  lower  rate,  but  we  don't  see 
that  in  Table  8  also  there  are  other  kinds  of  shifts  influencing  the  attrition 
behavior  in  these  years. 

3.  9  <  LOS  <  19;  the  maturina  carrier  commitment  has  been  made  and  rates 
decline  with  increasing  LOS  and  GR. 

4.  19  <  LOS  <  30  ;  since  advancement  opportunities  of  the  senior  officer  are 
quite  limited  we  see  rates  increasing  with  LOS  and  decreasing  with  advances  in 

B.  RECOMMENDATIONS 

The  linear  approximation  to  the  effect  of  change  could  be  most  useful  if  we  could 
group  the  MOS  categories  into  sets  of  common  regression  coefficients  and  if  these 
coefficients  were  stable  over  time.    To  pursue  each  of  these  contingencies  requires 
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additional  work  and  an  expanded  data  base.  The  programs  developed  in  this  thesis 
serve  as  a  foundation  for  extension. 
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APPENDIX  A 
APL  FUNCTIONS 

1.  GENERAL 

This  appendix  contains  APL  functions  for  the  data  manipulation,  logistic 
regression  and  the  validation  of  the  model.  The  original  data  is  on  a  magnetic  tape 
named  COUNTS  prepared  by  Navy  Personel  Resarch  and  Development  Center 
(NPRDC).  Robinson  [Ref.  2]  explained  the  conversion  of  raw  data  from  tape  to  an 
APL  workspace.  In  order  to  get  the  LOSSXX  (Losses)  and  INVXX  (Inventories) 
arrays,  the  procedure  should  be  followed  in  the  order  presented  by  Robinson.  "XX"  is 
the  applicable  fiscal  year.  (e.g.  77  for  fiscal  year  1977) 

2.  DATA  MANIPULATION  FUNCTIONS 

Some  APL  functions  were  developed  by  Tucker  and  Robinson  for  the  data 
manipulation  and  exucution  of  calculations  pertaining  to  the  processes  under 
evaluation.  These  functions  will  be  summarized  in  the  following  section.  We  will  use 
some  of  them  in  this  project.  They  are  GETINV,  INVMATX,  GETLOSS  and 
MATRIX.  Also,  two  more  APL  functions  were  utilized  for  the  manipulation  of  the 
data  in  order  to  use  the  logistic  regression  and  validation. 

a.  Creating  the  inventory  and  loss  arrays 

Using  the  INVXX  arrays  and  the  APL  function  GETINV  in  Figure  A.l  and 
INVMATX  in  Figure  A.2  create  the  array  IXX.  Note  that  GETINV  calls  INVMATX 
and  INVMATX  uses  the  INVXX  arrays.  APL  workspace  size  limitations  may  be  a 
problem  due  to  the  large  amount  of  data.  It  may  be  necessary  to  create  one  or  two 
arrays  at  one  time  and  copy  them  to  another  workspace. 

The  LXX  arrays  are  created  in  a  manner  similar  to  the  above,  using  the  APL 
functions  GETLOSS  in  Figure  A. 3  and  MATRIX  in  Figure  A. 4  APL  function 
MATRIX  uses  the  loss  arrays  LOSSXX.  The  resulting  matrices  are  "IXX"  and  "LXX" 
for  fiscal  year  "XX".  The  function  "INVMATX"  and  "MATRIX"  could  create  a  matrix 
of  the  following  dimension  7x40x10x31  for  7  years,  40  MOS's,  10  grades  and  31  LOS's. 
However,  due  to  limited  workspace,  the  dimension  of  40x31x10  for  40  MOS's  31  LOS's 
and  10  Grades  was  commonly  utilized. 
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V  GETINV 
[  1  ]  p  THIS  FUNCTION  CALLS  THE  FUNCTION  INVMATX 
[  2  ]  p  FOR  EACH  FISCAL  YEAR .  IXX  IS  THE  INVENTORY 
[3]  r  ARRAY  FOR  FISCAL  YEAR  XX  BY  OF/LOS/GRADE. 
[4]  I77+INVMATXINV77 
[5]  I78+INVMATX INV7  8 
[6]  I79+INVMATX INV79 
[7]  I8Q+INVMATX INV80 
[8]  I81+INVMATX INV81 
[9]  I82+INVMATX INV82 
[10]  I83+INVMATX  INV83 
[11]  •  SHAPE  OF  177  IS   ' 
[12]  D^pI77 

7 


Figure  A.l     APL  Function  GETINV. 


V  Z+INVMATX  X:A:B:C:D:E:F:I:J 

11] 

p  CREATES  THE  INVENTORY  ARRAYS  FOR  THE  FISCAL 

12: 

p  YEARS  USING  THE  ARRAYS  OF  INDEXES  INVXX . 

:a: 

fl  INVXX  MUST  BE  A  CHARACTER  VECTOR  OF  9  DATA 

"L+" 

p  ENTRIES  FOLLOWED  BY  1  BLANK  FOR  EACH  LOOP. 

is; 

Z*(40  31  10)p0 

16; 

7>pX 

17: 

LOOP:^(j-0)/OUT 

181 

19; 

p  A«-a(l+X) 

110 

]  B«-l  +  (a(2+X«-(l+X))) 

111 

C«-l+(a(l*X«-(2*XJ } ) 

:i2 

Z?-e-l  +  U(2+X"e-(l+X;55 

[13 

]  E*<l>(3*X«-(2+X)) 

:i4 

:i5 

Z[B;Z?:C]*£ 
]  X<-(4+X) 

:i6 

t/^J-1 

:i7 

]  +LOOP 

:i8 

]  0£/7/:  '  FINISHED  --  SHAPE  OF  MATRIX  IS   ■ 

[19 

]pZ 

Figure  A.2    APL  Function  INVMATX. 

b.  Manipulation  of  the  data  for  regression  and  validation 

The  function  GETCENTNV  in  Figure  A. 5  creates  the  central  inventory  which 
assigned  CIXX  for  the  fiscal  years  from  1977  to  1983.  The  function  GETCENTNV  uses 
the  global  variables  of  "IXX"  and  "LXX"  for  the  inventory  and  loss  matrices 
respectively,  for  fiscal  year  "XX". 
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V  GETLOSS 

[i: 

q  THIS  FUNCTION  CALLS  MATRIX  FOR  EACH  FISCAL 

[2] 

0  YEAR .  LXX  IS  THE  LOSS  ARRAY  FOR  FISCAL  YEAR 

[3: 

p XX  BY  OF/LOS/GRADE. 

[4] 

Lll '^MATRIX  L0SS7  7 

[5] 

L7  8+MATRIX  LOSS! 8 

[6] 

L79+MATRIXLOSS79 

C7! 

L80+MATRIXLOSS80 

[8; 

fl  L81+MATRIX  L0SS81 

[9] 

R  L82+MATRIX  L0SS82 

C1C 

>]  p  L83+MATRIXLOSS83 
V 

Figure  A.3  APL  Function  GETLOSS. 


V  Z+MATRIXX;A;B;C;D;E:F:I;J 
[  1  ]  r  THIS  FUNCTION  CREATES  THE  LOSS  ARRAY  FOR  THE  FISCAL 
[  2  ]  R  YEARS  USING  THE  ARRAY  OF  LOSS  INDICES  LOSSXX .  IT  IS 
[  3  ]  r  CALLED  BY  GETLOSS .  LOSSXX  MUST  BE  A  CHARACTER  VECTOR 
[4]  p  WITH  9  DATA  ENTRIES  FOLLOWED  BY  1  BLANK  FOR  EACH  LOOP. 
[5]  Z^(40  31  10)p0 
[6]  I+pX 
[7]  J<-(I+1U10 
[8]  LOOP:^(j-Q)/OUT 
[9]  A^<t>(l*X) 
[10]  B^l  +  (<l>(2+X<-(H<X 
[11]  C«-l  +  (a(l+X«-(2+X 
[12]  C<-1 +  (<!>( 2 +X«-(1*X 
[13]  £><!>(  1 +X^ ( 2 *X)) 
[14]  F*<I>(2+X<-(1*X)) 
[15]  ZZB',D;C1+ZIB,D;C1+F 
[16]  X«-(3+X) 
[17]  J+J-l 
[18]  +LOOP 

[19]  Of/27 :  '  FINISHED  -  -  SHAPE  OF  MATRIX  IS « 
[20]  pZ 

V 


Figure  A.4    APL  Function  MATRIX. 

The  function  GETDATA  in  Figure  A. 6  manipulates  the  data  for  regression 
and  validation  procudures.  The  outputs;  IEST  and  LEST  are  the  sum  of  CIXX  and 
LXX  respectively  where  "XX"  is  the  fiscal  years  1977  to  1980,  i.e.  the  first  4  years  are 
used  for  the  estimation.  "IVALXX"  and  "LVALXX"  are  the  CIXX  and  LXX 
respectively  where  "XX"  here  is  the  fiscal  years  from  1981  to  1983,  i.e.  the  last  three 
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V  GETCENINV 
[  1  ]  p  GET  THE  CENNTRAL  INVENYORY  DATA  FOR 
[2]  p  THE  FISCAL  YEAR  FROM  19  77  TO  1983 
[3]  CI77«-((I77+I78H2)rL77 
[4]  CT78«-(  (178+179  )i2)[L78 
[5]  CJ79^( (I79+I80K2 )T£79 
[6]  CT8  0«-(  (18  0+181  J*2  }rL80 
[7]  CI81+1 (I81+J82)*2)rL81 
[8]  CI82«-(I82+I83  )*2)f£82 
[9]  CJ83^I83rL83 

V 


Figure  A.5  APL  Function  GETCENINV. 


V  GETDATA 
CI]  p  MANIPULATE  THE  DATA  TO  USE  IN  REGGRESSION 
[2]  p  AND  VALIDATION  PROCUDURES 
[3]  IEST+CI77+CI7  8+CI7  9+CI80 
[4]  LEST<-L77+L78+L79+L80 
[5]  IVAL81+CI81 
[6]  IVAL82+CI82 
[7]  IVAL83+CI83 
[8]  LVAL81+L81 
[9]  LVAL82+L82 
[10]  L7AL83^-L83 

V 


Figure  A.6    APL  Function  GETDATA. 

years  are  used  for  the  validation  procudure.  The  function  GETDATA  uses  the  global 
variables  CIXX  and  LXX  for  the  central  inventory  matrix  and  loss  matrix  for  fiscal 
year  "XX". 

c.  Why  the  central  inventory? 

A  problem  arises  on  several  occasions  when  the  data  is  disaggregated  to  a 
level  for  which  the  inventory  is  very  small.  For  example,  when  examining  the  inventory 
in  a  particular  fiscal  year,  the  inventory  can  be  zero  for  a  length  of  service  (LOS)  and 
military  occupational  specialty  (MOS)  combination.  Examining  the  inventory  in  the 
next  fiscal  year  for  the  same  LOS  and  MOS  combination  may  also  be  zero.  The 
problem  arises  when  the  number  of  leavers  is  equal  to  or  greater  than  one. 
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V  LOGISTIC 

[  1  ]  p  THIS  IS  THE  MAIN  FUNCTION  FOR  THE  REGRESSION  DIAGNOSTICS 

[  2  ]  p  AND  THE  VALIDATION .  THIS  FUNCTION  CALLS  THE  FUNCTIONS 

[ 3 ]  p  FITTED,  RESIDUAL  AND  VALIDATION  WHICH  THEY  ALL  MUST  BE 
[  4  ]  p  IN  THE  SAME  APL  WORKSPACE . 

[ 5 ]  FITTED 

[6]  RESIDUAL 

[7]  VALIDATION 

[8]  UPP+8 

[  9  ]  >  WOULD  YOU  LIKE  TO  SEE  RES ,  FITTED  VALUES  AND  BET  AH  AT ' 

[10] 

'0  t  NO     1  l  YES1 

[11. 

KK+U 

[12: 

+  (KK=0)/Lm 

[13] 

'BETAHAT  IS   ' 

[14] 

BETA 

[15! 

'  VECTOR  OF  FITTED  VALUES ' 

[16] 

TETHAT 

[17] 

•  VECTOR  OF  COMPONENTS  OF  DEVIANCE  IS ' 

[18] 

DEV 

[19] 

•  VECTOR  OF  COMPONENTS  OF  CHI-SQUARE  IS ' 

[20] 

CHICOM 

[21, 

'TOTAL  DEVIANCE  IS  r,*D 

[22] 

1  CHI-SQUARE  TEST  STATISTIC  IS   •  ,  3>CtfI 

[23] 

L14 :  '  JvWLL  YOU  LIKE  TO  SEE  THE  VALIDATION  RESULTS  ■ 

[24] 

'0  :M9  1  tYES' 

[25] 

MM«-D 

[26] 

■*(MM=0)/L15 

[27] 

'  CHI -SQUARE  MOE  FOR  THE  VALIDATION  • 

[28] 

1  1981   1982   1983' 

[29] 

CtflSG 

[30] 

•  DEGREES  OF  FREEDOM  IS   '  ,  3»£EF 

[31] 

t   i 

[32] 

LI 5  :  'WOULD  YOU  LIKE  TO  RUN  FOR  ANOTHER  CASE1 

[33] 

<0:NO     1  :  YES ' 

[34] 

TT+U 

[35] 

*(2T=0)/0 

[36] 

LOGISTIC 

V 

Figure  A. 7    Apl  Function  LOGISTIC. 

This  can  occurs  because  the  inventory  figures  refer  to  the  instant  beginning  of 
the  fiscal  year,  and  the  loss  figures  refer  to  any  time  during  the  year.  I.e.  an  officer  can 
both  access  and  attrite  from  it  any  time  during  the  year.  Then  p  ( =  y/n)  would  be 
ambigous  where  y  is  the  leavers  and  n  is  the  inventory  at  time  t. 

For  the  purpose  of  removing  this  ambiguity  from  the  data,  the  following 
policy  was  adopted  to  define  the  central  inventory  number  for  the  officer  force  at 
disaggregated  levels  for  any  cells  or  collection  of  cells. 

1.  Let  t  =   1 6,  refer  to  the  year  1977 1982 

2.  Let  Y(t)  =  Number  of  losses  in  year  t 

3.  Let  INV(t)  =  Inventory  in  the  beginning  of  year  t 
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[1] 

[2] 

[3J 

[4] 

[5] 

[6] 

C7] 

r  3 1 

[9] 

[10] 

mi 

ri2 1 

[13] 

C14] 

[15] 

[16] 

[17] 

[18] 

[19] 

[20] 

[21] 

[22] 

[23] 

[24] 

[251 

[26] 

[27] 

[28] 

[29] 

[30] 

[31] 

[32] 

[33] 

[34] 

[35] 

[36] 

V  FITTED 

p  this  function  is  for  the  calculation  of  the 
p  coefficiens, fitted  values  of  the  logistic 
r  regression. 
•  enter  mos ' 
mos+d 
'  enter  los ' 

LOS+U 
'  ENTER  GR  • 
GR+U 

INV1+IEST [ ( 1 +MOS ) : ( 1 +LOS ) : ( 1 +GR ) 1 
LOSSl<-£ESr[(l+M0S);  (1+LOS);  (1+GR)1 
K+a(.INVl) 
X*Su: 
X1*X 
EP-e-lE  8 
iVK-CK.l 
Y1«-CK,1 
e7-«-((,iVl 
XK-.7/X1 
Nl+J/Nl 
Yl+J/Yl 

BETA+((l+(cXl)) 
L2-.BETA1+BETA 
TETHAT+Xl+.xBETA 

S+Y1-N1*PHAT+ ( ( *TETHAT ) * ( 1 + (*TETHAT ) ) ) 
71-e  (iVl x  ( *TETHAT )  ) *  (  ( 1  +  ( *TETHAT )  ) * 2  ) 

/v«-p7«-,7i 


3,^)p(Xpl),(,^(((pG/?),(pL05))pL05)),(XpG/?)) 


'p(.LOSSl) 
i*0) 


DpO 


^(((^JV)plO)*(l/V)°.  =  (iW) 
BETA+BETA+ii  (l(((Wl  )  +  .x7 


x7)+.xXl))+.x($Xl))+.xS) 
R++S(BETA=BETA1') 
^L2xxEP<  |  /?  —  p  .BETA 
TETHAT+Xl+.xBETA 
I+(\N)°.=(\N) 

((  (7*0.5  )  +  .xXl)  +  .xr@(((Wl)  +  .x7)  +  .xXl))) 


Ml4-I-((S+.x(^Xl))+.x(7* 
MZH+/(((iiV)o.  =  (iJV))x/h) 

V 


7*0.5)) 


Figure  A. 8     Apl  function  FITTED. 

4.  Let  N(t)  =  Maximum  of  Y(t)  and  the  average  inventory  using  the  beginning 
inventory  in  year  t  and  t+1  and  computing  their  avarage  (INV(t) 
+  INV(t+ l))/2.  N(t)  is  the  central  inventory  of  year  t.  This  will  "provide  the 
elements  for  a  more  accurate  estimation  of 'the  attrition  rate  on  the 
disaggregated  level. 

3.        LOGISTIC  REGGRESSION  AND  VALIDATION  FUNCTIONS 

The  following  APL  functions  were  utilized  for  the  logistic  regression  and  the 
validation  of  the  model.  These  functions  must  be  in  the  same  APL  workspace.  Also, 
they  use  the  global  variables;  I  EST,  LEST,  IVALS1,  IVAL82,  IVAL83.  LVALS1, 
LVAL82  and  VAL83  which  are  the  output  of  the  function  GETDATA. 
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[1] 

[2] 

[3] 

[4] 

[5] 

[6] 

[7] 

[8] 

[9] 

[10 

[11 

[12 

[13 

[14 

[15 

[16 

[17] 

[18" 

[19 

[20 

[21 

[22 

[23 

[24 

[25 

[26 

[27 

[28] 

[29" 

[30 


V  RESIDUAL 
p  THIS  FUNCTION  IS  FOR  THE  CALCULATION  OF  THE 
o  RESIDUAL  VECTORS  OF  THE  REGRESSION. 
fl<-(,Yl*0)A((,Yl)*,/\/l) 
NH+H/ ,N1 
YH+H/ ,Y1 
Pl+YH+NH 

TETHA+®(P1*(1-P1)) 
TH+Hl.TETHAT 
DEV+    2*(TH-TETHA) 

DEVl+(pTETHAT )pH\DEV 

U<-,Yl-0 

NU+U/ ,N1 

PHAT.U+U /  ,PHAT 

A1  +    2*NUx(®M+(l-PHATU)) 

Al+(pTETHAT)pU\Al 

DEV2+DEV1+A1 

Z+(,Y1)=,N1 

NZ<rZ/  ,N1 

PHAT.Z+Z/  ,PHAT 

A2  +    2*NZ*(®PHATZ) 

A2+(pTETHAT)pZ\A2 

DEV+DEV2+A2 

£«-  +  /( \DEV) 

C1+DEV<0 

C2+DEV2Q 

DEV+(C2-C1)*(( \DEV)*0.5) 

TETA + ( p  TETHA T)pH\ TETHA 

VAR+NlxPHAT* ( 1 -PHAT ) 

CHI++/((S*2)*VAR) 

CHICOM+S*(VAR*0. 5 ) 
V 


Figure  A.9     Apl  Function  RESIDUAL. 

a.  Function  LOGISTIC 

APL  function  LOGISTIC  in  Figure  A. 7  is  the  main  function  for  the  regression 
and  validation  calculations.  This  function  calls  FITTED,  RESIDUAL  and  the 
VALIDATION  functions.  These  functions  cannot  be  run  alone.  They  must  be  run  by 
the  function  LOGISTIC.  In  other  words,  they  are  just  the  subfunctions  of  the  main 
function  LOGISTIC.  These  subfunctions  will  be  discussed  following. 

b.  Function  FITTED 

APL  function  FITTED  in  Figure  A. 8  finds  the  fitted  values  of  the  regression. 
This  function  uses  global  variables  "IEST"  and  "LEST". 

c.  Function  RESIDUAL 

APL  function  RESIDUAL  in  Figure  A.9  calculates  the  array  of  the  residuals. 
This  function  is  just  the  continuation  of  the  function  FITTED. 
filesect  Function  VALIDATION 
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[1] 

[2] 

[3] 

[4; 

C5! 

[6; 
[7; 

[8] 
[9" 


V  VALIDATION 
p  THIS  FUNCTION  IS  FOR  THE  CALCULATION  OF  THE 
fl  CHI -SQUARE  STAT.  (CHISQ  )  FOR  THE  FISCAL  YEARS 
n  FROM  1981  TO  1983. 
CHISQ+3pO 

INV2+IVAL81l(l+M0S){(l+L0S){(l+GR)l 

LOSS2+LVAL81l(l+MOS);  (1+LOS);  (,1+GR)1 

+L10 

m:INV2+IVAL82l(l+MOS):  (1+LOS):  (1+GR)1 


[10]  LOSS2+LVAL82t(l+MOS); (1+LOS); (1+GR)1 


[11 
[12 
[13 


[14]  L10:Tl<r(,lNV2*0) 


[15 

[16 

[17 

[18 

[19 

[20 

[21 

[22] 

[23" 

[24 

[25 

[26 

[27 


+L10 

L5:INV2+IVAL83L(l+MOS); (1+LOS); (1+GR)1 
LOSS2+LVAL83i(l+MOS);  (1+LOS);  (1+GR)1 


NT1+T1/ ,INV2 
IT1+T1/.LOSS2 
P+YT1+NT1 
P+(K,l)pTl\P 
N2+(k,l)p(iINV2) 
PHAT1+(K,1 )pj\ ( tPHAT) 
D+(PHAT1X0)/\(PHAT1*1) 
X+  +  /D 

CHISQ  [  J] «-+/  (  (  (PHAT1  -P)*2) x/V2  x£  ) *  (PHAT1  x  ( 1 -PHAT1  )  ) 
I+I  +  l 
+  (I=2)/m 
-KI=3)/L5 
DEF+N-3 
V 


Figure  A.  10    Apl  Function  VALIDATION. 

APL    function    VALIDATION    in    Figure    A.  10    calculates    the    Chi-Square 
statistics  for  the  fiscal  years  from  19S1  to   1983.  This  function  uses  global  variables 
IVALXX  and  LVALXX  where  "XX"  are  the  fiscal  years  from  19S1  to  1983. 
d.  Description  of  the  output  variables 

In  this  section,  we  will  describe  the  output  variables  which  are  used  in  the 
APL  functions. 

BETA       :  vector  of  the  regression  coefficients 

TETHA     :  vector  of  logit(p)  where  p  =  y/n 

TETHAT  :  vector  of  fitted  values 

DEV  :  vector  of  components  of  the  deviance 

CHICOM  :  vector  of  individual  components  of  x 

MD  :  vector  of  diagonal  elements  of  projection  matrix 

CHI  :  the  chi-squared  goodness  of  fit  statistic  for  estimation  years 

D  :  total  deviance 
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CHISQ      :  the  vector  of  chi-squared  test  statistic  for  validation  years 
DEF  :  degrees  of  freedom 
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APPENDIX  B 
GRAPHS 

This  appendix  contains  graphical  illustration  of  the  fitting  for  the  estimation  of 
USMC  officer  attrition  rates.  Some  cases  were  selected  from  the  USMC  manpower 
data  to  illustrate  whether  logistic  regression  model  fit  well  the  data  or  not.  Each  case 
has  its  own  regression.  From  Figure  B.l  through  the  Figure  B.8,  for  each  case, 
following  plots  are  showed. 

1.  logistic  probability  plot  of  components  of  the  deviance 

2.  logistic  probability  plot  of  components  of  the  chi-square 

3.  scatter  plot  of  fitted  values  vs  components  of  the  deviance 

4.  scatter  plot  of  fitted  values  vs  components  of  the  chi-square 
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Figure  B.l     Illustration  of  fitting  for  MOS  =  3,  LOS  =  0-6,  GR  =  4-6. 
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Figure  B.2     Illustration  of  fitting  for  MOS  =  7,  LOS  =  0-6,  GR  =  4-6. 
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Figure  B.3     Illustration  of  fitting  for  MOS  =   13,  LOS  =  0-6,  GR  =  4-6. 


46 


o 


00 
O 


a: 
< 

=) 
o 
oo 
j_ 
x 
a 

U_ 

o 

i/i 

t— 
z 


o 

CL 

? 

o 

u 

LO 

U- 

o 

VI 

n 

a: 

a 

o 

m 

VI 

o 

<tf 

LL 

o 

\— 

to 

00 

C3 

V 

O 

_J 

00 

o 

_l 

V 

u 
z 
< 

Si 

Q 


Z 

o 
a. 

O 
O 


o 

Q. 

m 

o 

0- 


1/1 

O 

o 


j_ j |__Ns*  j j f i_-i L i 

* 1 + — 1 l*"»j — 1 r~4 ! r 

J 1 x-.l I 1._*\. L..1 I L 

' 13 i--' i 1 ' '"' '■ 3 


C»    O)        <*~        *n        CN        «- 

31UN33H3d 


j\ 

► 

;    | 

~j — \ 

if)    O       if)       O       if>       O    IT) 

o  <j)     r--     m     cm     *- 

3~lliN30y3d 


UJ 

< 

O 
t/1 

J_ 

X 

o 

u. 
O 

00 

I— 
2 


z 

_L 

n 

X 

n 

o 

2 

o  u. 

n 

O 

u 

t/i 

(r» 

z 

> 

Ul 

z 
o 

7% 

00 

UJ 

n 
_: 
<r 

o 

> 

o 

I— 

3     a. 


< 

Q 

U. 

o 

00 

t— 
z 

UJ 

z 
o 

Q. 

O 

o 


I  5 

W  00 

O  3 

O    Q.  ' 

2  < 

8  > 


O 
_J 

CL 


</1 

UJ 

■   -J 

o 

LJ 

1= 


I  0  I-  Z- 

3HVn0S-IH0  30  SlN3N0dH00 


Z  I  o 

30NVIA3Q  30  S1N3N0JN00 


Figure  B.4     Illustration  of  fitting  for  MOS  =  20,  LOS  =  0-6,  GR  =  4-6. 
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Figure  B.5     Illustration  of  fitting  for  MOS  -  3,  LOS  =  19-29,  GR  =  7-9. 
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Figure  B.6     Illustration  of  fitting  for  MOS  =  7,  LOS  =   19-29,  GR  =  7-9. 
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Figure  B.7     Illustration  of  fitting  for  MOS  =   13,  LOS  =   19-29,  GR  =  7-9. 
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Figure  B.8     Illustration  of  fitting  for  MOS  =  20,  LOS  =   19-29,  GR  =  7-9. 
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